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A DATA SEQUENCING METHOD TO IMPROVE TRANSMISSION OF SELF- 
SIMILAR DATA IN A MULTI-NODE NETWORK 

BACKGROUND OF THE INVENTION 

[0001] The present invention concerns sequencing of data in a multi-node 
communications network, in particular a data sequencing method to improve 
transmission of self-similar data through the network and reduce buffer expenditures 
within the network. 

[0002] Broadband local area network (LAN) traffic, bursty variable bit-rate 
(VBR) video traffic and wide-area network (WAN) traffic, such as on the Internet, have 
been shown to have self-similar, fractal, properties. Self-similar traffic behaves very 
differently than voice traffic or traffic predicted by packet traffic models, which 
typically conform to Poisson-like distributions. In a Poisson-like packet traffic model, 
aggregate traffic become smoother (i.e. less prone to bursts of data) as the number of 
traffic sources increases. The self-similar nature of LAN and VBR video traffic, on the 
other hand, cause such traffic to manifest a self-similar burstiness at every time scale. 
Bursts, consisting of bursty sub-periods separated by less bursty sub-periods, are seen in 
the traffic on a millisecond time scale, as well as on an hour time scale. This self- 
similar, fractal, pattern imparts a long-range dependence (LRD) in the traffic. The 
burstiness of the composite of such traffic tends to increase, rather than decrease, as the 
number of traffic sources increase. 

[0003] Self-similar burstiness may lead to a reduction in network performance. 
Transmission nodes within the network can only handle a certain data rate, which is 
often exceeded at the height of a burst, leading to congestion and data loss. One 
solution is to reduce the amount data travelling through the network, thus decreasing the 
overflow frequency of bursts. This is not a desirable solution, however, as it results in 
the underutilization of network resources. Merely increasing the buffer size in the 
network may not relieve the problem as the burstiness of the data exhibits itself on 
several time scales. Greater buffer size may, however, increase the delay for traffic 
passing through the network. 



MATI-201US 

-2- 

[0004] Networks that may be affected by long range dependency include 
broadband ISDN networks and Internet networks. This effect of long-range dependency 
is discussed in an article by W. Leland et al. entitled " On the Self-Similar Nature of 
Ethernet Traffic," Proc. ACMSigcomm '93, San Francisco, 1993, pp. 183-193 and in an 
5 article by A. Erramili et al. entitled " Experimental Queuing Analysis with Long-Range 
Dependent Packet Traffic," IEEE/ACM Transactions on Networking vol. 4, no. 2, 1996. 

[0005] One solution for handling data that exhibits long range dependency is to 
incorporate a queuing buffer at the source node to store excess data during bursts for 
later transmission during intervening lower traffic periods, effectively smoothing the 

10 data bursts. This method is presently in use, especially for VBR video data. As data 

comes into a transmission node, it enters the back of the queuing buffer. Meanwhile, the 
oldest data in the queuing buffer is taken off of the front of the buffer at a predetermined 
rate and transmitted. The remaining data in the buffer is moved forward. This system 
smoothes the data traffic across transmission nodes and allows the network to operate at 

1 5 a high efficiency if the full bandwidth of the network can be filled with similar fixed- 
bandwidth components. The amount of data waiting in the buffer fluctuates with the 
incoming traffic level, absorbing the burstiness of the stream. To effectively handle 
LRD data (i.e. maintaining a low cell loss rate), the smoothing buffer must be large. 
Larger buffers, however, cause increased delay that may not be acceptable for some 

20 applications. 

SUMMARY OF THE INVENTION 

[0006] The present invention is embodied in a scheduling method for data in a 
digital communication network that improves transmission by breaking up long-range 
dependencies in the data while preserving local order. The scheduling method reorders 

25 individual blocks of data in a macro-block while preserving the order of the data in each 
block. An exemplary embodiment of this method contains several steps. The first step 
is to define the data in terms of macro-blocks, each macro-block including a plurality of 
blocks. The blocks of a macro-block are reordered to substantially eliminate the long- 
range dependence of the data in the macro-block. The reordered blocks are then 

30 transmitted through the network, and reordered again at the other side to recreate the 
macro-block. 



MATI-201US 



-3- 

[0007] According to one aspect of the invention, a digital communications 
network includes a shuffling buffer at an input node to shuffle blocks of the data and 
thereby substantially eliminate long-range dependence in the data. It also includes at 
least one transmission node through which the shuffled data is sent, and a reconstruction 
5 buffer at a client node to reconstruct the shuffled data into its original order. 

BRIEF DESCRIPTION OF FIGURES 

[0008] Figures la, lb, lc, Id, and le are graphs illustrating self-similarity of data 
traffic in a digital communication network. 

[0009] Figure 2a is a semi-log graph illustrating a heavy tail characteristic of self- 
l o similar data traffic. 

[0010] Figure 2b is a semi-log graph illustrating Poisson-like data traffic. 

[0011] Figure 3 is a block diagram of an exemplary multi-node digital 
communication network of the present invention. 

[0012] Figure 4 is a drawing illustrating an exemplary shuffling of sub-blocks 
15 according to the present invention. 

[0013] Figure 5 is a flowchart illustrating an exemplary method of the present 
invention. 

[0014] Figure 6 is a semi-log graph showing the results of experiments carried out 
by the inventors of an exemplary shuffling method. 

20 DETAILED DESCRIPTION OF THE INVENTION 

[0015] One exemplary embodiment of the present invention uses shuffling 
techniques to aid the flow of traffic through a computer network. An exemplary 
deterministic shuffling is to introduce skips of a certain number of blocks of data to be 
transmitted through the network. For example, one may reorder nine blocks, originally 
25 sequentially numbered 1-9, with skips of three segments: 1, 4, 7, 2, 5, 8, 3, 6, 9. In this 
example, the nine blocks form a macro-block. Once a certain number of skips are made, 
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in this example two, the order jumps back to the first remaining block. This number is 
known as the restart number. Though simple, a shuffling of this sort substantially 
reduces the long-range dependency, and, if the skips are large enough, can substantially 
eliminate LRD behavior, at least up to a certain time range. After one macro-block has 

5 been transmitted, the same procedure is applied to the remaining macro-blocks until all 
of the data in the data stream has been transmitted. Although the exemplary 
embodiment of the invention is described in terms of a block-shuffling algorithm that 
uses skips and restarts, it is contemplated that it may be practiced with many different 
data resequencing operations that change the order of blocks of data according to a 

10 predetermined deterministic scheme. 

[0016] The present invention is described in terms of blocks of macro-blocks. A 
macro-block may be as large as the entire data set and a block may be as small as a 
single frame. Each block includes multiple data values and the order of the data in each 
blocks is maintained even when the blocks are reordered. While the invention is 
15 described in terms of reordering blocks in macro-blocks, it is contemplated that, in 
addition, the macro-blocks themselves may be reordered. 

[0017] Figures la-e show exemplary traffic levels through a node of digital 
communications network transmitting LAN data. The graph in Figure la contains time 
series 100, which shows the number of packets transmitted per 100 second interval. 

20 Also highlighted in Figure la are time series 102, a randomly selected tenth of time 
series 100, and time series 104, a randomly selected tenth of time series 102. Each of 
graphs in Figures lb-e contain progressively more finely resolved portions of the time 
series in the preceding graph(s). An important feature of these graphs is the similar 
levels of burstiness shown over all of these time scales. This is an example of the self- 

25 similarity seen in LAN and VBR data. 

[0018] Figures 2a and b are semi-log graphs of illustrating the logarithm of the 
overflow frequency of a queuing buffer versus the buffer size for an exemplary level of 
traffic in the network. Figure 2a represents the overflow frequency for self-similar 
traffic, which exhibits long-range dependencies. Heavy tail 200 is indicated, showing 
30 that, for LRD data, there is only a small improvement in loss rate when the buffer size is 
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increased, at least over a certain range of buffer size values. Figure 2b represents the 
overflow frequency of Poisson-like, or random, traffic flow. 

[0019] Figure 3 shows an exemplary digital communications network 300 of the 
present invention. Data traffic from server 302 enters shuffling buffer 306 where blocks 

5 of data are detenriinistically reordered. The reordered blocks of data are then 

transmitted through queuing buffers 310 of node(s) 312 of the digital communications 
network and reordered again by reconstruction buffer 308 to reconstruct the original 
data. Client 304 may then receive the data which they desire from the traffic. The 
subject invention deterministically reorders blocks of data in a macro-block to preserve 

10 local order (i.e. the order of the data in each block) and, thus, avoid a complete 

randomization of the data. Randomization of bursty data may not be beneficial if the 
random reordering clusters high-data rate intervals. To understand how this may occur, 
consider the transmission of a VBR video stream that has been encoded according to the 
standard specified by the Moving Pictures Experts Group (MPEG). This video stream 

15 includes infra-frame coded images (I-frames), predictively encoded images (P-frames) 
and bidirectionally encoded images (B-frames). I-frames typically include more data 
than P-frames which typically include more data than B-frames. If the block unit is 
chosen to be an image frame, and the macro-block unit a predetermined number of 
frames (e.g. 1024 frames), then a random reordering of the frames may group I-frames 

20 next to each other, resulting in a burst of data that is too large for the network buffers to 
handle. Data in one of the I-frames would then be lost, severely degrading the video 
image reproduced from the reordered frames. The terms " block" and " macro-block" 
as used in the present invention are not to be confused with the terms of the same name 
used in the MPEG standard. 

25 [0020] In addition to the advantage of reduced overflow frequency in the 

exemplary multi-node digital communications network of the present invention shown 
in Figure 3, such a network may have the advantage of reduced memory use. In a multi- 
node digital communications network relying only on queuing buffers to smooth traffic, 
there is a queuing buffer 310 at each transit node 312, with a large memory, Qq. In the 

30 exemplary multi-node digital communications network of the present invention shown 
in Figure 3, there is a shuffling buffer, with memory, S mem , and a reconstruction buffer, 
with memory, but, because the reshuffling operation has reduced long-range 
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dependency in the data, each of the network nodes uses a smaller amount of memory, 
Ql, in the queuing buffer. Equation (1) shows the net memory saved, M, for the 
exemplary multi-node digital communications network of the present invention shown 
in Figure 3 with N nodes. 

5 M^CQo-QirN-lS^ + R^J (1) 

[0021] For a particular data transmission, it may be better to consider the net 
delay through a network having H+l hops, that is to say the transmission passes through 
H transit nodes in its path from the input end-node to the output end-node. In a multi- 
node digital communications network relying on queuing buffers to smooth traffic, there 

10 is a buffer delay at each node, Q t . If LRD data is transmitted through the network, the 
delay is Qot If the data is preprocessed to reduce LRD, the delay is Qit- In the 
exemplary multi-node digital communications network of the present invention shown 
in Figure 3, there is a shuffling buffer delay, S t , and a reconstruction buffer delay, 
Equation (2) shows the time saved, T, for transmission of data through the exemplary 

15 multi-node digital communications network of the present invention shown in Figure 3 
where the data passes through H transit nodes. 

T = (Q t0 -Qti)*H-(S t + R t ) (2) 

[0022] Figure 4 illustrates data traffic being reordered in a shuffling buffer by an 
exemplary method, discussed above to create a shuffled time series, using skips of S 

20 blocks and a restart number of R. The exemplary block numbers 400 in the lower part 
of the blocks and the order numbers 402 illustrate the case where S = 3 and R = 2. 
Thus, the exemplary shuffling algorithm sends a block, skips three blocks to send the 
next block, skips three blocks again to send the next block and then restarts at the block 
next to the first block that was sent. Thus, the skip number, S, refers to the number of 

25 blocks that are skipped between successive transmissions and the reset number, R, refers 
to the number of skips after which the position of the transmission is reset to the block 
next to the block that was sent at the beginning of the last skip cycle. As an example, 
the block numbers 400 in Figure 4 are the order in which the original blocks were 
received and order number 402 is the order in which the shuffled blocks are transmitted 

30 through the network using this exemplary shuffling technique to reorder the data. 
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[0023] Thus, the number of blocks, Nmb, in a macro-block that is used in a 
shuffling scheme having a skip number S and a restart number R, is given by equation 

(3). 

Nmb = s (R+i) (3) 

5 [0024] Figure 5 is a flowchart of an exemplary method of the present invention to 
remove long-range dependencies and thereby decrease overflow frequency in a digital 
communications network. The method begins with step 500, by defining the data as 
macro-blocks of data blocks. For a VBR MPEG video sequence, for example, the 
blocks may be picture records and the macro-blocks may be groups of 1024 picture 

10 records. 

[0025] Next, step 502 selects a first macro-block and, at step 504 the blocks in 
this macro-block are deterministically reordered. In the exemplary embodiment, the 
MPEG video picture records are shuffled with a skip number of 32 and a restart number 
of 32. The degree to which the long-range dependence, and other deterministic 
15 characteristics, of the data are eliminated depends both upon the shuffling method 
selected and the parameters of that shuffling method. 

[0026] An exemplary embodiment of the present invention uses the method of 
skips and restarts for creation of reordered data sets, as described above with reference 
to Figure 4. Although the example shows the use of skips of three segments (blocks) 

20 and restarting at the next value after two skips, larger numbers of the skipped segments 
(skip numbers) and larger restart numbers to produce greater reductions in the long- 
range dependency of the original data traffic. Larger skip numbers and larger restart 
numbers, however, also increase the size of the shuffling and reconstruction buffers. 
The effectiveness of this exemplary shuffling method for creating data sequences with 

25 reduced LRD also depends on the block size, which may affect the complexity of the 
shuffling circuitry. 



[0027] The three steps, 500, 502, and 504, represent an initial delay in 
transmitting a data stream. In one exemplary embodiment of the invention, an entire 
macro-block is resequenced and stored in the shuffling buffer before transmission 
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begins. In another embodiment, only a portion of the macro-block may be stored into 
the reshuffling buffer before transmission begins, with the remainder of the macro-block 
being shuffled and stored as the earlier shuffled blocks are transmitted. In a third 
embodiment of the invention, the shuffling buffer may not perform any shuffling 
5 operation but, instead, may be indexed using the skip and restart values to dynamically 
reorder the blocks. 

[0028] In one exemplary embodiment of the invention, where the data to be 
transmitted is a VBR MPEG bit-stream, the step 500 may search the bit-stream for 
picture headers and store the data associated with each picture header into a separate cell 
10 of the reshuffling buffer. To accommodate I-frames as well as P-frames and B-frames, 
each cell holds an amount of data that is at least equal to the largest amount of data that 
may be generated for an I-frame. 

[0029] As each block of data from the shuffling buffer is transmitted in step 506, 
a corresponding block from the next macro-block is reordered and inserted in place of 

15 the data that was just transmitted at step 508. This exemplary method overlaps the 
reordering of the data in the next macro-block with the transmission of data from the 
current macro-block. After step 508, control transfers to step 506 to transmit the next 
block. Because the shuffling of each macro-block after the first macro-block overlaps 
the transmission of the first macro-block it does not represent any additional delay for 

20 the system. 

[0030] When the reordered blocks are transmitted in step 506, they pass through 
at least one node 310 of the network 300. Once the reordered blocks have been received 
at the destination node, they are reordered again by the reconstruction buffer to recreate 
the original data traffic, step 510. As with steps 500, 502 and 504, the reordering of the 
25 data in a macro-block may overlap the receipt of the data. Alternatively, the data may 
be stored directly into a reshuffling buffer using, for example, skip and restart numbers 
that are transmitted with the data, so that the blocks of data may be stored directly into 
their final positions in the macro-block as they are received at the receiving end-node. 

[0031] Figure 6 is a semi-log graph showing the results of experiments carried out 
30 by the inventors using the exemplary shuffling method of skips and restarts previously 
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described with regard to Figure 4. All of the curves show the logarithm of the overflow 
frequency as a function of the queuing buffer size in an exemplary digital 
communications network for VBR video data. The block size chosen was 300 ms. 
Curve 600 illustrates exemplary data for a reordering of the VBR video data with a skip 
5 number of 2 and a restart number of 2 . The skip numbers and restart numbers for the 
remaining curves, 602, 604, 606, 608, and 610, are shown in the Table. 



TABLE 



Curve # 


Skip# 


Restart # 


602 


4 


4 


604 


8 


8 


606 


16 


16 


608 


32 


32 


610 


64 


64 



[0032] Although the blocks shown in the Table are defined in terms of an amount 
10 of displayed video time represented by each block, it is contemplated that other 
measures may be used to form blocks of data. Each block may, for example, be a 
predetermined number of bytes. If this embodiment is used with a VBR video stream, 
each block may represent a respectively different amount of video display time. 

[0033] Although the embodiments of the invention described above have been in 
15 terms of a multi-node digital communication network, it is contemplated that similar 
concepts may be practiced within other networks. Additionally, it is contemplated that 
the functions of the shuffling and reconstruction buffers previously described may be 
carried out within a general purpose computer system instructed to perform these 
functions by means of a computer-readable carrier. Such computer-readable carriers 
20 include, for example, magnetic and optical storage media, as well as audio-frequency, 
radio frequency, and optical carrier waves. Also, it will be understood to one skilled in 
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the art that a number of other modifications exist which do not deviate from the scope of 
the present invention as defined by the appended claims. 



